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Abstract 

Motivated by the problem of nonparametric inference in iiigh level digital 
image analysis, we introduce a general extrinsic approach for data analysis on 
Hilbert manifolds with a focus on means of probability distributions on such sam- 
ple spaces. To perform inference on these means, we appeal to the concept of 
neighborhood hypotheses from functional data analysis and derive a one-sample 
test. We then consider analysis of shapes of contours lying in the plane. By embed- 
ding the corresponding sample space of such shapes, which is a Hilbert manifold, 
into a space of Hilbert-Schmidt operators, we can define extrinsic mean shapes 
of planar contours and their sample analogues. We apply the general methods to 
this problem while considering the computational restrictions faced when utilizing 
digital imaging data. Comparisons of computational cost are provided to another 
method for analyzing shapes of contours. 

Keywords: data analysis on Hilbert manifolds, extrinsic means, nonparametric 
bootstrap, planar contours, digital image analysis, automated randomized land- 
mark selection, statistical shape analysis 

1 Introduction 

It is the purpose of this paper to introduce general methodology for data analysis on 
infinite dimensional Hilbert manifolds. Nonparametric procedures for inference on the 
population mean are included, focusing on an extrinsic approach. Theoretical results 
for both estimation and testing hypotheses are derived. Dette and Munk (1998) BOl 
rekindled the interest in neighborhood hypotheses by showing that they are appropri- 
ate and useful in a nonparametric functional context. Since "in practice" once cannot 
expect an infinite dimensional object to be exactly equal to a prescribed hypothesized 
object, the neighborhood hypothesis will be employed too in this paper. 
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Just as in finite dimensions, the general theory could be applied to a variety of 
special manifolds. However, here we will restrict ourselves to the important case of 
projective spaces that will be embedded in the Hilbert-Schmidt operators, which is 
useful in the analysis of shapes of infinite dimensional configurations lying in the plane. 
Necessary computational considerations are considered for the implementation of the 
methodology and the computational cost turns out to compare favorably with that of 
other procedures used in shape analysis. 

Let us now turn to a brief discussion of the existing literature that is most relevant 
for the present paper To the best of our knowledge, the combination of the theory 
for functional data in infinite dimensional linear spaces with the geometry for infinite 
dimensional manifolds for the purpose of nonparameteric analysis is new. There are, 
however, some procedures that seem to lack the underlying asymptotic theory needed 
for a nonparametric analysis. These will be reviewed along with some pertinent theo- 
retical results for functional data, and some relevant existing methods for finite dimen- 
sional manifolds. 

A number of statistical methodologies have been developed for the analysis of data 
lying on Hilbert spaces for the purpose of studying functional data. Some multivariate 
methods, such as PCA, have useful extensions in functional data analysis (Loeve ll33l ). 
For dense functional data, the asymptotics of the resulting eigenvalues and eigenvec- 
tors were studied by Dauxois et. al. (1982) 110] . Even for sparse functional data, these 
methods have been proved useful (see Hall et. al. (2006)lfT7l, MuUer et. al.(2006)ll39l) 
and have multiple applications. There are, nevertheless, techniques defined for multi- 
variate analysis that often fail to be directly generalizable to infinite-dimensional data, 
especially when such data is nonlinear. New methodologies have been developed to ac- 
count for these high dimensional problems, with many of these presented in a standard 
text by Ramsay and Silverman (2005) and given by references therein. For high dimen- 
sional inference on Hilbert spaces, Munk and Dette (1998) POl utilized the concept of 
neighborhood hypotheses for performing tests on nonparametric regression models. 
Following from this approach, Munk et al. (2008) BTI developed one-sample and 
multi-sample tests for population means. 

However these methods do not account for estimation of means on infinitely dimen- 
sional curved spaces, such as Hilbert manifolds. In order to properly analyze such data, 
these methods must be further generalized and modified. A key example in which such 
data arises is in the statistical analysis of direct similarity shapes of planar contours, 
which can be viewed as outlines of 2D objects in an image. 

Unlike functional data analysis, which started from dense functional data and was 
extended to sparse functional data, the study of shapes in the plane originated with D. 
G. Kendall (1984) 1231 , which showed that the space of direct similarity shapes of 
nontrivial planar finite configurations of k points is a complex projective space CP'""^ . 
However, definitions of location and variability parameters for probability distributions 
were considered much later While the full Procrustes estimate of a mean shape was 
defined by Kent (1992), a nonparametric definition of a mean shape was first introduced 
by Ziezold (1994) 1541 based upon the notion of Frechet population mean (Frechet 
(1948) 112, Ziezold (1977) El). This approach was followed by Ziezold (1998) 
Il53l . Le and Kume (2000) (22, Kume and Le (2000 El, 2003 (22), Le (2001) QUI . 
Bhattacharya and Patrangnearu (2003) JS], and Huckemann and Ziezold (2006) ifTSl . 
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While a majority of these methods have defined means in terms of Riemannian 
distances, as initially suggested by Patrangenaru (1998) p2l . a Veronese- Whitney 
(VW) extrinsic mean similarity shape was also introduced by Patrangenaru (op.cit.), 
in terms of the VW embedding of CP'^^'^ into the space S{k — 1,C) of selfadjoint 
(fc — 1) X (fc — 1) matrices. The asymptotic distribution of this extrinsic sample mean 
shape and the resulting bootstrap distribution are given in Bhattacharya and Patrange- 
naru (2005) Is), Bandulasiri et. al (2009) H and Amaral et at. (2010) HI. 

Motivated by the pioneering work of Zahn and Roskies (1972) and by other appli- 
cations in object recognition from digital images, Grenander (1993) lfT4l considered 
shapes as points on some infinite dimensional space. A manifold model for direct si- 
milarity shapes of planar closed curves, first suggested by Azencott (1994) ||2l, was 
pursued in Azencott et. al. (1996)131, and detailed by Younes (1998 EH, 1999 ISOl ). 
This idea gained ground at the turn of the millennium, with more researchers studying 
shapes of planar closed curves (eg. Sebastian et. al. (2003) 145)1 ). 

Klassen et al. (2004) (26l, Michor and Mumford (2004) lH and Younes et al. 
(2008) ED follow the methods of Small (1996) EH and Kendall by defining a Rie- 
mannian structure on a shape manifold. Klassen et al. (op. cit) compute an intrinsic 
sample mean shape, which is a Frechet sample mean for the chosen Riemannian dis- 
tance. The related papers Mio and Srivastava (2004) Mio et al. (2007) 133, 
Srivastava et al. (2005) ED, Mio et al. (2005) |l37|, Kaziska and Srivastava (2007) 
II2TI . and Joshi it et al. (2007) ||20| computean intrinsic sample mean similarity shape 
of closed curves modulo reparameterizations, for a Riemannian metric of their prefer- 
ence, out of infinitely many Riemannian metrics available. To compute these types of 
means, those papers use iterative gradient search algorithms, as closed-form solutions 
for the means do not exist. 

However, these approaches only consider computational aspects without address- 
ing fundamental definitions of populations of shapes, population means and population 
covariance operators, thus making no distinction between a population parameter and 
its sample estimators. As such, the idea of statistical inference is absent from these 
papers. 

This paper is organized as follows. In Section 2, we introduce methodology for 
data analysis on Hilbert manifolds, including a definition for the extrinsic mean (set) of 
a random object an embedded Hilbert manifold and introduce an inference procedure 
for a neighborhood hypothesis for an extrinsic mean. In Section 3, we define the space 
of direct similarity shapes of planar contours. Section 4 is dedicated to the derivation 
of the asymptotic distribution of the extrinsic sample mean contour shape. Due to the 
infinite-dimensionality, the sample mean cannot be properly studentized, so in Section 
5, we instead apply the neighborhood hypothesis test to this problem. 

The remainder of the paper concerns application of the methodology to digital 
imaging data. In section 6 we address the representation, approximation, and corre- 
spondence problems faced when working with such data in practice. In Section 7, we 
present examples of the neighborhood hypothesis test using contours extracted from a 
database of silhouettes from digital images, collected by Ben Kimia 12511 . In Section 
8, we use nonpivotal nonparametric bootstrapping (see Efron (1979) ITTl . Hall (1992) 
lfT6l ) to form confidence regions for the extrinsic mean shape for examples from the 
same database and compare computational speed of our method to that of Joshi et. 
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al. (2007) II20I . The paper ends with a discussion suggesting an extension of other 
methodologies from functional data or for data on finite dimensional manifolds to data 
analysis on Hilbert manifolds. An extension of the shape analysis methods to more 
complicated, infinite dimensional features captured in digital images, such as edge 
maps obtained from gray-level images, is also suggested. 



2 Inference for means on Hilbert manifolds 

In this section we assume that H is a separable, infinite dimensional Hilbert space over 
the reals. Any such space is isometric with I2, the space of sequences x = (.T„)„gN of 
reals for which the series Y1^=q convergent, with the scalar product < x,y > = 
J2'^=o ^nUn- A Hilbert space with the norm j|u|| = ^< v, v >, induced by the scalar 
product, becomes a Banach space. Differentiabihty can be defined with respect to this 
norm. 

DEFINITION 2.1. A function / defined on an open set [/ of a Hilbert space H is 
Frechet differentiable at a point x € U,if there is a linear operator T : H ^ H, such 
that if we set 



(1) uj^ih)^ f{x + h)- f{x)^T{h), 
then 

(2) li-^=0. 

h^O \\h\\ 



Since T in Definition l2.1l is unique, it is called the differential of / at x and is also 
denoted by dxf- 

DEFINITION 2.2. A chart on a separable metric space {Ai , p) is a one to one home- 
omorphism ip : U ^ defined on an open subset U of to a Hilbert space H. 

A Hilbert manifold is a separable metric space Ai , that admits an open covering by 
domain of charts, such that the transition maps tpv ° Vt/^ • Vu{U DV) fviU O V) 
are differentiable. 

Example 1. The projective space P(H) of a Hilbert space H, the space of all one 
dimensional linear subspaces of H, has a natural structure of Hilbert manifold mod- 
elled over H. Define the distance between two vector lines as their angle, and, given 
a line L C H, a neighborhood C/l of L can be mapped via a homeomorphism 
onto an open neighborhood of the orthocomplement by using the decomposition 
H = L ® L^. Then for two perpendicular lines Li and L2, it is easy to show that 
the transition maps (^l^ o ip^^ are differentiable as maps between open subsets in hj^, 
respectively in L^. A countable orthobasis of H and the lines L„, n G N generated by 
the vectors in this orthobasis is used to cover -P(H) with the open sets J/l,,, n G N. 
Finally, use the fact that for any line L, and H are isometric as Hilbert spaces. The 
line L spanned by a nonzero vector 7 e H is usually denoted [7] when regarded as a 
projective point on P(H) and will be denoted as such henceforth. 



4 



Similarly, one may consider complex Hilbert manifolds, modeled on Hilbert spaces 
over C. A vector space over C can be regarded as a vector space over the reals, by re- 
stricting the scalars to R; therefore any complex Hilbert manifold automatically inherits 
a structure of real Hilbert manifold. 

2.1 Extrinsic analysis of means on Hilbert manifolds 

For Hilbert spaces that do not have a linear structure, standard methods for data analysis 
on Hilbert spaces cannot directly be applied. To account for this nonlinearity, one may 
instead perform extrinsic analysis by embedding this manifold in a Hilbert space. 

DEFINITION 2.3. An embedding of a Hilbert manifold Mina Hilbert space H is 
a one-to-one differentiable function j : Ai ^ H, such that for each x G Ai, the 
differential d^j is one to one, and the range is a closed subset of H and the 

topology of A4 is induced via j by the topology of H. 

Example 2. We embed P(H) in Chs = H ® H, the space of Hilbert-Schmidt opera- 
tors of H into itself, via the Veronese-Whitney (VW) embedding j given by 

(3) jib]) = tA^I^I- 

\n\\ 

If II 7 II = 1, this definition can be reformulated as 

(4) j([7])=7®7- 

The range of this embedding is the submanifold i of rank one Hilbert-Schmidt ope- 
rators of H. 

To define a location parameter for probability distributions on a Hilbert manifold, 
the concept of extrinsic means from Bhattacharya and Patrangenaru (2003, 2005)) is 
extended below to the infinite dimensional case. 

DEFINITION 2.4. If j : ^ H is an embedding of a Hilbert manifold in a Hilbert 
space, the chord distance p on is given by p(x, y) = \\j{x) — j{y)\\, and given a 
random object X on the associated Frechet function is 

(5) F,{x) = E{\\j{X) - j{x)f). 

The set of all minimizers of Fj is called the extrinsic mean set of X. If the extrinsic 
mean set has one element only, that element is called the extrinsic mean and is labeled 
^E,j or simply ^e- 

PROPOSITION 2.1. Consider a random object X on Ad that has an extrinsic mean 
set. Then (i) i{X) has a mean vector /i and (ii) the extrinsic mean set is the set of all 
points X G Al, such that j{x) is at minimum distance from /i. (iii) In particular, jiE 
exists if there is a unique point on j(M) at minimum distance from /i, the projection 
Pj{y) of /I on j(M), and in this case ^e ~ j^^{Pj{f^))- 
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Proof. Let Y = j{X). Note that the Hilbert space H is complete as a metric space, 
therefore inf^eH E{\\Y - yf) = mmy^M E{\\Y - y\\^) < min.ye,(^) E{\\Y - yf), 
and from our assumption, it follows that inf^gn -E'dl^^ = minygn ^(11^ ^ 
is finite, which proves (i). To prove (ii), assume for is a point in the extrinsic mean 
set, and x is an arbitrary point on G Ai. From E{\\j{i^) — < E{\\j{x) — 

and, since j (i^) — fi and j{x) — fiaie constant vectors, it follows that 



It is obvious that the expected value on the extreme righthand side of equation (|6]l 
is zero We now consider a property that is critical for having a well-defined, unique 
extrinsic mean. 

DEFINITION 2.5. A random object X on a Hilbert manifold M embedded in a 
Hilbert space is j-nonfocal if there is a unique point p on j{A4) at minimum distance 
fmm E{j{X)). 

Example 3. The unit sphere 5(11) = {x e H, ||a;|| = 1} is a Hilbert manifold embed- 
ded in H via the inclusion map j : S'(IHI) H, j{x) = x. A random object X on 5(H) 
of mean ^ is j-nonfocal, if /i 7^ 0. 

Using this property, one may give an explicit formula for the extrinsic mean for a 
random object on -P(H) with respect to the VW embedding, which we will call VW 
mean. 

PROPOSITION 2.2. Assume X = [T] is a random object in P(H). Then the VW 
mean of X exists if and only if £'( plipFigjr) has a simple largest eigenvalue, in which 
case, the distribution is j-nonfocal. In this case the VW mean is he — [7], where 7 is 
an eigenvector for this eigenvalue. 

Proof. We select an arbitrary point [7] e P(H), ||7|| = 1. The spectral de- 
composition of A = i?(-pTjjr) is A = X^fcLi '^fc-^fc' '^1 — '^2 > •■• where for all 
k > l,Ek = Ck ^ efe, ||efe|| = 1, therefore if 7 = J2kLi ^k^k, J2T=i ^1 < ^1 ^en 
||j(7) - fi\\^ = II7 (8) 7IP + J2kLi 5fe - 2 < A,7 ® 7 > . To minimize this distance 
it suffices to maximize the projection of the unit vector 7 (g) 7 on A. If (5i = 62 there 
are the vectors 71 = ei and 72 = 62 are both maximizing this projection, therefore 
there is a unique point j([7]) at minimum distance from A if and only if 5i > 62- 
A definition of a covariance parameter is also needed in order to define asymptotics 
and perform inference on an extrinsic mean. The following result is a straightfor- 
ward extension of the corresponding finite dimensional result in Bhattacharya and Pa- 
trangenaru (2005). The tangential component tan{v) of u e H w.rt. the orthobasis 
ea(-P;(Ai)) € Tp.(^)j{M), a = 1, 2, • • • , c» is given by 



Then given the j-nonfocal random object AT, extrinsic mean ^e, and covariance op- 



erator of E = cov{j{X)),xf faiiiE) = rf;^(ea(Pj(M))),Va = 1, 2, . . . , then X has 



(6) 



\[m - A^ir < \\3{x) - Mil' + 2£;(< 3{x) - 3{v), fi-Y>). 



00 



(7) 
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extrinsic covariance operator represented w.rt. the basis • • • by the infinite 

matrix Sj,^: 



With extrinsic parameters of location and covariance now defined, the asymptotic dis- 
tribution of the extrinsic mean can be shown as in the final dimensional case (see Bhat- 
tacharya and Patrangenaru (2005)|l9l): 

PROPOSITION 2.3. Assume Xi, . . . are i.i.d. random objects (ro.'s) from a 
j-nonfocal distribution on a Hilbert manifold A^, for a given embedding j : — > H 
in a Hilbert space H with extrinsic mean fiE and extrinsic covariance operator E. 
Then, with probability one, for ?i large enough, the extrinsic sample mean X„ ^ is well 
defined. If we decompose i{Xn.E) — jImb) with respect to the scalar product into a 
tangential component in Tj^^j^yjiAi) and a normal component A'j(^^)j(A^), then 



where Q has a Gaussian distribution in Tj^^^j^^j{M.) with extrinsic covariance operator 



2.2 A one-sample test of the neighborhood hypothesis 

Following from a neighborhood method in the context of regression by Munk and Dette 
(1998)1401, Munk et al. (2008)01] developed tests for means of random objects on 
Hilbert spaces. We now adapt this methodology for tests for extrinsic means. First, 
however, we will need the following useful extension of Cramer's delta method. The 
proof is left to the reader 

THEOREM 2.1. For a = 1, 2 consider an embedding ja : TWa ^ of a Hilbert 
manifold Aia in a Hilbert space Ha- Assume Xi, . . . , X„ are i.i.d. ro.'s from a ji- 
nonfocal distribution on A^i for a given embedding ji : A^i — > Hi in a Hilbert space 
Hi, with extrinsic mean ji^ and extrinsic covariance operator E. Let Lp : M.i ^ M2 
be a differentiable function, such that f{Xi) is a j2-iionfocal r.o. on 7^2- Then 

(10) V^tonj2(<p(^E))02(v(-?«,£;)) - J2(v5(/^b))) ->d Y, 

where Y ^ J\f{0, c?,,£,(/3*Ed,,^(y3). Here L* is the adjoint operator of L. 

We can now define the neighborhood hypothesis procedure for tests of extrinsic 
means. Assume Ej is the extrinsic covariance operator of a random object X on the 
Hilbert manifold Ai , with respect to the embedding j : Ai ^ M. Let Mq be a compact 
submanifold of Ai . Let ipQ : Ai ^ M.he the function 



(8) 




(9) 



y/n{tan{j{Xn^E) " j(M-e))) G, 



(11) 



foip) 



min - j(po)||^ 
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and let Mg, Bg be given respectively by 

Mt = {peM,Mp)<S'}, 

(12) Bf, = {peM,Mp)^S^,}- 

Since ipo is Frechet differentiable and all small enough (5 > are regular values of ipo, 
it follows that Bq is a Hilbert submanifold of codimension one in Ai. Let i^p be the 
normal space at a points p e Bg , orthocomplement of the tangent space to Bq at p. We 
define Bg 

(13) Bg'^ = {p G Bg, Sjl^pis positive definite}. 

DEFINITION 2.6. The neighborhood test consists of testing the following two hy- 
potheses: 

i/oiMijeM^UB^^^, 

(14) HA-.fiEe {Mtr n (B^^^r- 

Munk et al. (2008) [4r show that, in general, the test statistic for these types of 
hypotheses has an asymptotically standard normal distribution for large sample sizes, 
in the case of random objects on Hilbert spaces. Here, we consider neighborhood 
hypothesis testing for the particular situation in which the submanifold Mg consists 
of a point mg on M. We set ipo — (fmo, and since Tmoi'mo} = we will prove the 
following result. 

THEOREM 2.2. If Mq = {yrig}, the test statistic for the hypotheses specified in (fT4l l 
has an asymptotically standard normal distribution and is given by: 

(15) r„ ^ ^/r^{.^™^(/t_E) - (5^}/s„, 
where 

(16) sl^A{P,SE,nl>) 

and 

I " 

(17) ®(tan. dj^^PjUiX,) - JiX)J) 
is the extrinsic sample covariance operator for {X^}"^]^, and 

Proof. The function Lpo given in equation ( fTTT ) defined on A4 can be written as 
a composite function ipo ~ ° j, where ^a{x) ~ \\x — A\\'^ is differentiable on 
H\{yl}, with the differential at x given by dx^A{y) ^ 2 < y,x ~ A > . Since j{M) 
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is a submanifold of H, the restriction (pAof^A (x) to j {A4 ) is a differentiable function, 
with the differential 

(19) dpMy) =2<y,p-A>,yye Tpj{M). 

Note that tprno {p) = 0i(mo)(j (p))j therefore, given that the differential dpj is a vector 
space isomorphism, we obtain 

(20) dp(pmo{u) = 2 < dpj{u),j{p) - j{mo) >, Vu S TpM, 
and in particular 

(21) dp^ip,no{u) = 2 < dp^j{u),j{fiE) -j{mo) >,Vit G Tp^^M, 
that is 

(22) d^j,ip„ia = 2d^^j (g) tan{j{nE) - j{mo)). 

Since the null hypothesis ( fT4l i is accepted as long as (p„i„{i^ie) < (5^, we derive the 
asymptotic distribution of ipmoifi-E) under (pmoifJ'E) — S'^- From Proposition 12.11 it 
foUows that 

(23) y/n{iprno (Ae) " Vmo (Ms)) ^rf 

where y ~ A/'(0, {dp_j^ipmo)*^Ed^EVmo)^ we see that the random variable 

has asymptotically a standard normal distribution. From equation (l22l i. if we set 

= idf,Ejy^to.n{j{fiE) - j{mo)), 

(25) cr^ = 4 <!/,!]£; 1/ >, 

then 

(26) Z„ = 

Finally we notice that v in ( fTSI l is a consistent estimator of in ( |25] | , therefore s\ in 
equation (fTSI l is a consistent estimator of in equation ( l25b and from Slutsky's the- 
orem it follows that the test statistic T„ in equation (fTSl ) has asymptotically a 7\A(0, 1) 
distribution. 



3 Similarity shape space of planar contours 

Features extracted from digital images are represented by planar subsets of unlabeled 
points. If these subsets are uncountable, the labels can be assigned in infinitely many 
ways. Here we will consider only contours, which are unlabeled boundaries of 2D 
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topological disks in the plane. To keep the data analysis stable, and to assign a unique 
labeling, we make the generic assumption that there is a unique point po on such a 
contour at the maximum distance to its center of mass so that the label of any other 
point p on the contour is the counterclockwise travel time at constant speed from po 
to p. As such, the total time needed to travel from po to itself around the contour 
once is the length of the contour Therefore we consider direct similarity shapes of 
nontrivial contours in the plane as described here. A contour 7 is then regarded as 
the range of a piecewise differentiable function, that is parameterized by arclength, i.e. 
7 : [0, L] — >■ C, 7(0) = 7(i) and is one-to-one on [0, L). Recall that the length of a 
piecewise differentiable curve 7 : [a, 5] — > is defined as follows: 



The contour 7 is said to be regular if 7 is a simple closed curve and there is a unique 
point zq — argmaxze^\\z — 

A direct similarity is a complex polynomial function in one variable of degree one. 
Two contours 71 , 72 have the same direct similarity shape if there is a direct similarity 
S* : C — ?> C, such that 8(^1) = 72 . The centered contour 70 = 7 — 2;^ ~ {z z^, z G 
7} has the same direct similarity shape as 7. 

DEFINITION 3.1. Two regular contours 71,72 have the same similarity shape if 
72,0 = A71.0, where A is a nonzero complex number 

In order to construct the space of direct similarity shapes, we note the following. 

REMARK 3.1. A function 7 : S*^ ^ C is centered if /^i ^{z)ds = 0. We consider 
regular contours since the complex vector space spanned by centered functions 7 yield- 
ing regular contours 7 is a pre-Hilbert space. Henceforth, we will be working with the 
closure of this space. This Hilbert space H can and will be identified with the space of 
all measurable square integrable centered functions from to C. 

Let be the set of all direct similarity shapes of regular contours, which is the 
same as the space of all shapes of regular contours centered at zero. 

REMARK 3.2. From Definition l3.1l and Remark lTTl we associate a unique piecewise 
differentiable curve 7 to a contour 7 by taking 7(0) = zq, the point at the maximum 
distance to the center of C, and by parameterizing 7 using arc length in the counter 
clockwise direction. Therefore Yf2^ is a dense and open subset of P(H), the projective 
space corresponding to the Hilbert space H. Henceforth, to simplify the notation, we 
will omit the symbol ~in 7 and identify a regular contour with the associated closed 
curve, without confusion. 



(27) 




and its center of mass (mean of a uniform distribution on 7) is given by 



(28) 
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4 The Extrinsic Sample Mean Direct Similarity Shape 
and its Asymptotic Distribution 

Note, from Example 2, that /'(H) is a Hilbert manifold which is VW-embedded in the 
Hilbeit space Chs, and, from Proposition |23] the VW mean ^ie of a ro. X = [F] in 
P(H), is [ei], where ei is the eigenvector corresponding to the largest eigenvalue of 

PROPOSITION 4.1. Given any VW-nonfocal probability measure Q on P(H), then 
if 7i, . . . , 7„ is a random sample from F, then, for n large enough, the VW sample 
mean fiE^n is the projective point of the eigenvector corresponding to the largest eigen- 
value of i Y.l=i ip^7» ® 

We can now derive the asymptotic distribution of /i^ „ based upon the general 
formulation specified in Proposition 12.31 The asymptotic distribution of j{X)^ is as 
follows: 

(29) - ^J.) ~^dG as rn- CX3, 

where G has a Gaussian distribution Ncnsi^' ^) ^hs ^ zero mean and covari- 
ance operator E. From Proposition 12.21 it follows that the projection Pj : Chs — ^ 
j(P(H)) c Chs is given by 

(30) Pj{A) = iyA®i^A, 

where is the eigenvector of norm 1 corresponding to the largest eigenvalue of A, 

Pjif^) = jips), and Pj{j{X)^) = j(A-E,«) ■ Applying the delta method to 
yields 

(31) MiiMn) - Nc^siO,d^P,^id^Pjf), 

as n — !> oo, where d^Pj denotes the differential, as in Definition |2.1| of the projection 
Pj , evaluated at fi. It remains to find the expression for d^Pj. To determine the formula 
for the differential, we must consider the equivariance of the embedding J. Because of 
this, we may assume withoutloss of generality that /x = diag{5^}a=i,2,3,...- As defined 
previously, the largest eigenvalue of /i is a simple root of the characteristic polynomial, 
with ei as the corresponding complex eigenvector of norm 1, where jiE = [ei]- An 
orthobasis for r[g^]P(H) is formed by ea, ita, for a = 2, 3, ... , where Ca is the eigen- 
vector over R that corresponds to the a-th eigenvalue. For any 7 which is orthogonal 
to ei w.rt. the real scalar product, we define the path iji-yit) = [cos(t)ei + sin(t)7]. 
Then Tj([ej]) j(-P(H)) is generated by the vectors tangent to such paths at t = 0. Such 
vectors have the form 7 Cgi ei + ei (X) 7. In particular, since the eigenvectors of /i are 
orthogonal w.r.t. the complex scalar product, we may take 7 = a = 2, 3, . . . , or 
7 = iea, a = 2, 3, . . . to get an orthobasis for Tj(-[gj]) j(P(H)). Normalizing these vec- 
tors to have unit lengths, we obtain the following orthonormal frame for a = 2, 3, ... : 

(32) d^j{ea) = 2-i/2(ea ®ei+ei® Ca), 

(33) df,j{iea) = i2"^/^(ea «) ei + ei (g) Ca), 
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As stated previously, since the map j is equivariant, we may assume that is a 

diagonal operator D, with the eigenvalues Sf > (5| > ... In this case, 

(34) d^,j{ea) = 2-'^^El=Fl 

(35) d^Jitea)=t2-'/^El^tF^, 

where has all entries zero except those in the positions (a, b) and {b, a) that are 
all equal to 1 . From these formulations and computations of the differential of Pj in 
the finite dimensional case in Bhattacharya and Patrangenaru (2005), it follows that 
doPjiE^) = 0, for all values a < b, except for a = 1 < 6,. In this case 

(36) doPj (F^) = ^2^^! - ^dPj i^F^) - ^J^^^i'- 

Equation ( [36b implies that the differential of the projection Pj at /i is the operator Qi 
given by 

oo ^ 

(37) ^?i = E^23a2^'^' 

k=2 1 

where (5^, . . . are the eigenvalues of E{y^T (X) F) and £'i, £'2, . . . are the corres- 
ponding eigenprojections. Also, in this situation, Q is a normally distributed random 
element in Chs ■ This results in the tangential component of the difference between 
the j - images of the VW sample mean and of the VW mean having an asymptotic 
normal distribution, albeit with a degenerate covariance operator From these compu- 
tations, the asymptotic distribution of this difference can be expressed more explicitly 
in the following manner. 

(38) \/n{tan{j{fLE,n) - i{^^E))) ^ QiG, 

where tan{v) is the tangential component of u e j(P(H)) with respect to the basis 
ea{Pj{fJ.)) € Tpj.(^)j(P(H)), for a — 2,3, . . . and is expressed as 

(39) tan{v) = {e2{PAt^)f v, e:i{P, v , , . . . )^. 

However, this result cannot be used directly because Qi, which is calculated using 
the eigenvalues of E{j^T ® F), and /ig are unknown. This problem is solved by 
estimating by jlE,n and Qi in the following manner. 

00 

(40) ^i = E|^^^' 

k=2 "1 "k 

where 81, 82, ■■ ■ are the eigenvalues of 



n ^ — ^ " - ■ " ^ 



1 ,A 1 
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and Ei,E2t ■ ■ are the corresponding eigenprojections. Using this estimation, the 
asymptotic distribution is as follows: 

(42) V^(ta^(jXAB,n) - j(Mij))) « Qi5, 

d 

where "w" denotes approximate equality in distribution and tan is the tangential com- 
ponent relative to the tangent space of jiPi^SX)) at j{fiE,n) as in ( [39l ). where /i is 
replaced with /t in equation i4T[ Applying this result to dSTT i. we arrive at the follo- 
wing. 

THEOREM 4.1. If Ti, . . . , r„ are i.i.d.ro.'s from a VW-nonfocal distribution Q on 
P(H) with VW extrinsic sample mean fiE.n, then 

(43) Vn[i{jlE,7i) ~ j{pi-E)) « <ip,^PjG as n -J> 00, 
where fin = j(-^)„ is a consistent estimator of /i. 

REMARK 4.1. It must be noted that because of the infinite dimensionality of Q, in 
practice, a sample estimate for the covariance that is of full rank cannot be found. 
Because of this issue, this result cannot be properly studentized. Rather than using a 
regularizarion technique for the covariance that leads to complicated shape data compu- 
tations, we will drastically reduce the dimensionality via the use of the neighborhood 
hypothesis methodology presented in Section 2.2. This type of approach showed its 
efficiency in projective shape analysis of planar curves in Munk et al. (2008). 

5 The One-Sample Neighborhood Hypothesis Test for 
Mean Shape 

Suppose that j : -P(H) Chs is the VW embedding in ^ and (5 > is a given 
positive number Using the notation in Section 2, we now can apply Theorem l2.2l to 
random shapes of regular contours. Assume Xr = [jr], \\lr\\ = 1, = 1, . . . , n is a 
random sample from a VW-nonfocal probability measure Q. Then equation ( |43] | shows 
that asymptotically the tangential component of the VW-sample mean around the VW- 
population mean has a complex multivariate normal distribution. Note that such a 
distribution has a Hermitian covariance matrix (see Goodman, 1963 03]), therefore in 
this setting, the extrinsic covariance operator and its sample counterpart are infinite- 
dimensional Hermitian matrices. In particular, if we extend the CLT for VW-extrinsic 
sample mean Kendall shapes in Bhattacharya and Patrangenaru (2005), to the infinite 
dimensional case, the j-extrinsic sample covariance operator SE,n, when regarded as 
an infinite Hermitian complex matrix has the following entries 

(44) SE,n,ab = n-\Sl - Sl)-\6l - 6l)-^ 

n 

< Sa^r >< eh,7r >* | < 61, 7^ > ^ , a, 6 = 2, 3, . . . 

r=l 
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with respect to the complex oithobasis 62 , 63 , 64 , . . . of unit eigenvectors in the tangent 
space T/i^ ^P(H.). Recall that this orthobasis corresponds via the differential ^ 
with an orthobasis (over C ) in the tangent space Tj^^p^^ j(P(H)), therefore one can 
compute the components of D from equation JTSl l with respect to 62, 63, 64, ... , and 
derive for in ( fT6] l the following expression 

00 

(45) si = 4 ^ SE,nMb'>''^, 

aM=2 

where SE,n.ab given in equation (l44l) are regarded as entries of a Hermitian matrix. 
The test statistic r„ in equation ( fTSI ) is defined on an infinite dimensional Hilbert ma- 
nifold. In the next section we will explain how to accurately compute approximations 
of Tn based on finite dimensional polygonal approximations of the regular contours. 

6 Approximation of Planar Contours 

Ideally, the shapes of planar contours could be studied directly. However, when per- 
forming computations, it is necessary to approximate the contour by evaluating the 
function at only a finite number of times. If k such stopping times are selected, then 
the linear interpolation of the yielded stopping points is a k-gon z, for which each 
stopping time is a vertex. As with the contour, 2 is a one-to-one piecewise differen- 
tiable function that can be parametrized by arclength. Let Lk denote the length of the 
fc-gon. For j ~ 1, . . . , fc, let z{tj) denote the jth ordered vertex, where tj G [0, Lk) 
and z{ti) = z(0) = z{Lk). It follows that, for s £ (0, 1), the fc-gon can be expressed 
as follows; 

{(i2 - sLk)z{0) + sLkz[t2) < sLk < t2 

{tj - sLk)z{tj^i) + {sLk - tj^i)z{tj) tj^i < sLk < tj 
(Lk - sLk)z{tk) + {sLk - tk)z{0) tk < sLk < Lk 

for j = 3, . . . , fc. As such, the space of direct similarity shapes of non-self-intersecting 
regular polygons is dense in the space of direct similarity shapes of regular contours. 
Therefore, the theory and methodology discussed in sections 3 through 5 hold for the 
shapes of these functions, as well. For the purposes of inference using the neighbor- 
hood hypothesis, then, it suffices to use the test statistic as derived previously. However, 
when considering these approximations, it is important to choose the stopping times 
appropriately so that the contour is well approximated by the polygon. Additionally, 
one must take correspondence across contours into consideration when working with a 
sample. We will first present an algorithm for choosing stopping times in such a way 
that the fc-gon well represents the contour and converges to it accordingly. Following 
that, we will address considerations for working with samples. 

6.1 Random Selection of Stopping Times 

To obtain approximations, we propose to randomly select a large number fc of stop- 
ping times tj from the uniform distribution over [0, L). By doing so, we insure, on 
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one hand, that we ultimately use a sufficiently large number of vertices so that the 
fc-gon well represents the contour On the other hand, we ensure the desired density 
of stopping points. In order to maintain the within sample matching for a sample of 
regular contours, we first find the point zq at the largest distance from the center of 
the contour and choose that as z{0). We then randomly select k — 1 stopping times 
from the uniform distribution to form the fc-gon, making sure to maintain the proper 
ordering, preventing the fc-gon from self-intersecting. This is accomplished by sorting 
the selected stopping times in increasing order It is important to choose an appropri- 
ate number of stopping times for the given data. The selected stopping points will be 
distributed fairly uniformly around the contour for large values of fc, ensuring that the 
curve is accurately represented by the fc-gon. However, choosing too many stopping 
times will needlessly increase the computational cost of performing calculations. This 
will be most noticeable when utilizing bootstrap techniques to compute confidence re- 
gions for the extrinsic mean shape. Choosing too few stopping times, though, while 
keeping computational cost down, can be extremely detrimental as the stopping points 
may not be sufficiently uniform to provide adequate coverage of the contour This can 
significantly distort the fc-gon, as shown in Fig[T] In this particular instance, the 200- 
gon of the dog includes no information about the lower jaw of the dog and little detail 
about one of the ears. The length of the contour L can be used to assist in determining 



Figure 1 : a 200-gon (left) and the associated contour (right) of a dog 

an appropriate number of stopping points to be chosen. After selecting an initial set of 
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k stopping times as describe above, the length Lk of the fc-gon is 

fc+i 

(47) L,=^||z(t,)-z(t,_i)||, 

where z{tk+i) = ^(0). An appropriate lower bound for the number of stopping points 
can be determined by randomly selecting times for various values of k. Compute Lk 
for each of these /c-gons using (|47] | and compute the relative error compared to L. This 
should be repeated many times to obtain a mean relative error and standard deviation 
of the relative error for each value of k used. To determine an appropriate number of 
stopping points to use, compare the mean relative error to a desired threshold. Addi- 
tionally, the distributions of the relative errors could also be examined. It should be 
noted, however, that since digital imaging data is discrete by nature, the contour will 
be represented by K pixels. As such, it is often necessary to replace L by Lk, the 
length of the closest approximation to the contour, which can be calculated similarly 
to Lk- When using this algorithm to select stopping points, it follows that the fc-gon 
will converge to the contour However, when selecting an additional stopping time tj', 
care must be taken to properly alter z in such a way that ensures that there is no self- 
intersection of the fc-gon. To do so, simply reorder the stopping times in increasing 
order and apply the resulting permutation to the stopping points. It follows that, with 
probability 1, the length of the fc-gon between successive stopping points will converge 
to as the number of stopping times tends to infinity. This can be stated more formally 
as follows. 

LEMMA 6.1. If stopping times Si , S2 , . . . , Sfe are selected from a uniform distribution 
over [0, 1), then 

Ljnax = max WzjsjLk) ~ z{s.,^iLk)\\ A 0. 

Proof. 

P{Lmax > e) = P(A11 fc stopping points are within the remaining Lk — e) 
= P [ All k stopping times are within the remaining 1 



L 



We can assume without loss of generality that the section of the fc-gon for which the 
distance between successive stopping times is greater than e* is over the interval (0, e). 
In addition, since the stopping times are independently chosen. 



e \ \ / e 



where F is the cdf for the uniform distribution over the interval [0, 1). Taking the limit 
of this expression as fc — > oo results in Lmax ^ since 

e 



□ 



lim P{L^ax > e) = lim 1 - — =0 
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It follows immediately that the center of mass of the fc-gon converges to the cen- 
ter of mass of the contour While the fc-gon z converges to the contour 7, it is also 
of great interest to consider the convergence of \z\ to [7]. Since z and 7 are objects 
in the same space, the disparity in their shapes can be examined by considering the 
squared distance — i([7])|P in Chs- However, for the purposes of computa- 

tional comparisons, it is necessary to evaluate the functions at m > fc times using ( |46] | 
and approximate the distance in ^(m, C), the space of self-adjoint m x m matrices. 
To illustrate, consider the contour considered in Figure |2] The digital representation 



of this contour consists of K = 764 pixels. Stopping times were selected using the 
above algorithm to form /c-gons for k ~ 3, . . . , 763. Each fc-gon was then evaluated 
at 764 times corresponding to the each of the pixels on the digital image of the con- 
tour As such, squared distances between the fc-gons and the contour were computed 
in 5(764, C). After this was repeated 50 times, the means and standard deviations of 
the squared distances were calculated for each value of fc and are shown in Figure |3] 
The mean squared distance to the contour converges quickly, showing that the distance 
between [zk] and [7] only diminishes slightly for fc > 100. Moreover, the variability 
introduced by selecting the stopping times randomly also rapidly approaches 0. As 
such, it is clear that [7] is well approximated using k « K. However, while the 
overall shape is well approximated, it is unclear from this alone how well the details of 
7 are approximated. As such, using the distance between shapes may not be the best 
indicator for determining a lower bound for fc. For this purpose, it may be more helpful 
to consider {Lk — Lk)/LK, the relative error in the approximation of the length, as 
described previously. For the contour in Figure |2] the relative error in length is shown 
in FigureH] Here, while the variability approaches quickly, the average relative error 
approaches at a lower rate. As such, if it is desirable to keep the relative error below 
0.05, for this example, no fewer than 300 stopping times should be selected. 

6.2 Considerations for samples of contours 

In addition to ensuring that each contour in a sample is well approximated, since each 
contour must be evaluated at m times for computations, it is necessary that each be eva- 




FigUre 2: The digital image of the contour of a stingray. 



17 



200 300 



300 400 

k 



Figure 3l The mean and standard deviation of the squared distance of shapes of fc-gons to the shape of the contour, as 
evaluated in S(764,C). 




Figure 4: The mean and standard deviation of the relative eiTor of the length of the k-g 



luated at the same times to maintain correspondence across the n observations. In the 
ideal scenario, if each contour is well approximated by a fc-gon, then select k stopping 
times si, . . . , Sfc G [0, 1) using the algorithm as described above. For j = 1, . . . ,n, the 
stopping points for Zj can then be obtained by evaluating jj at times si- U , . . . , Sk- V , 
where U denotes the length of of 7^ . Fig. |5]shows two examples of utilizing this proce- 
dure for samples of contours of hand gestures. Using the same stopping times for each 
observation, 6 stopping points are highlighted in red to illustrate the correspondence 
across the sample. Alternatively, if contour j requires kj stopping points for adequate 
approximation, where ki 7^ kj for at least one pair i, 7, then select stopping times for 
each contour. Let 7} denote the set of stopping times that generate the kj-gon Zj. In 
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Fi gUre 5 1 CoiTespondence of 6 stopping points for contours of hand gestures of (a) the number '5' and (b) the letter 'L' 



order to maintain correspondence, evaluate Zj at the m times contained in U"^j^7i for 
j = 1 , . . . , n. This approach may also be utilized if each contour is approximated using 
k stopping points, but at different times. Finally, even if the conditions of either of the 
previous scenarios are met, it may be desired to consistently work within the same 
shape space when working with multiple samples, so it may be preferred to instead 
first consider approximating the contours and then approximating each at m subse- 
quently chosen times, thus separating the issues of approximation and correspondence. 
However, for each of these scenarios, the selection of stopping points, evaluation of 
the A:-gon at m times, and subsequent analysis can be either semi-automated or fully 
automated, allowing for efficient execution of the methodology. 

6.3 Approximation of the sample mean shape 

Whenever one is dealing with an object that is conceptually of infinite, or very high, 
dimension, a suitable dimension reduction must inevitably take place to enable com- 
puters to handle this object. Because this process is usually a projection from an infi- 
nite dimensional sample space of which the original object is an element, onto a finite 
dimensional subspace, we will for convenience refer to it as a "projection". In the cur- 
rent situation, the infinite dimensional object is the average of projection operators /x 
in equation dTTI) . which is a positive element in the Hilbert space of Hilbert-Schmidt 
operators. Above, this object has been approximated by rather high-dimensional pro- 
jection and then successively by projections of lower dimension, in order to arrive at 
an approximation of sufficiently low dimension that is still a good representative of the 
original object. What constitutes "good" here has not been established rigorously, but 
instead primarily on eye ball fitting, which may, in many cases, work rather well. A 
more sophisticated approach seems possible,however, and might be based on a method 
employed in the simulation of Brownian motion to determine a suitable number of 
points at which the values of the process should be simulated (Gaines, 2012 ifTSl ). The 
objective in this special case was to ensure that the first few largest eigenvalues of the 
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covariance operator of the projection would approximate those of the original Brown- 
ian motion with prescribed accuracy. This could be achieved by using expansions for 
the eigenvalues of the projection in terms of those of the original process, known from 
perturbation theory. Since the statistic of main interest in the application considered 
in this paper is the largest eigenvalue of F, a similar approach should, in principle, be 
appropriate in the present context. However, the problem of formally approximating 
infinite dimensional objects is a topic in its own right that is beyond the scope of the 
present paper, and that should, moreover, be considered in a more general context than 
presented by the situation at hand. 

7 Application of the Neighborhood Hypothesis Test for 
Mean Shape 

The test discussed in Section 5 could be performed for a variety of applications. The 
most likely applications involve having a known extrinsic mean shape determined from 
historical data. In such cases, the hypothesis test can be used to determine whether there 
is a significant deviation from the historical mean shape. An application in agriculture 
would be determining whether the use of a new fertilizer treatment results in the extrin- 
sic mean shape of a crop significantly changing from the historical mean. Similarly, 
this test could be performed for quality control purposes to determine if there is a sig- 
nificant defect in the outline of an produced good. In practice, 6 will be determined 
by the application and the decision for a test would be reached in the standard fash- 
ion. However, for the examples presented here, there is no natural choice for 6, so one 
can instead consider setting Z = ^i_q and solving for 5 to show what decision would 
be reached for any value of 6. To do so, it is important to understand the role of 6. 
The size of the neighborhood around mo is completely determined by 6. As such, it 
follows that smaller values of 6 result in smaller neighborhoods. In terms of Hq, this 
places a greater restriction on Mg and Bs, requiring /i^; to have a smaller distance to 
niQ. For the examples presented here, the contours are approximated using k ~ 300 
stopping times, so the shape space is embedded into 5(300, C) to conduct analysis. In 
this environment, consider having two fc-gons that are identical except for at one time. 
If this exceptional point for the second fc-gon differs from the corresponding point in 
the first fc-gon by a difference of 0.01 units, then the distance between the shapes inher- 
ited from S'(300, C) is approximately 0.014L For the hypothesis test, if 6 = 0.0141, 
then the neighborhood around toq would consist of distances between shapes similar 
in scope to the situation described above. First, consider an example for which the 
one sample test for extrinsic mean shape is performed for sting ray contours. In this 
case, the sample extrinsic mean shape for a sample of contours of n = 10 sting rays is 
the shape shown on the left hand side in Fig. |6] After performing the calculations, it 
was determined that for an asymptotic level 0.05 test, the largest value of 6 for which 
we would reject the null hypothesis is 0.0290. For perspective, this neighborhood has 
a radius roughly 2 times larger than the example with the nearly identical fc-ads de- 
scribed above. This means that we would only reject the null hypothesis if we required 
the sample extrinsic mean to be nearly identical to the hypothesized mean. It should 
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Fi gUre 6; The extrinsic sample mean shape of a sample of 10 sting ray contours and, respectively, the hypothesized 
extrinsic mean shape 



also be noted here that the sample size is small here, but that the conclusion agrees with 
intuition based upon a visual inspection of the contours. Now consider two examples 
involving contours of pears. In this first case, the sample consists of n = 87 pears. The 
sample extrinsic mean shape and hypothesized extrinsic mean shape are shown in Fig 
|2] It was determined that for an asymptotic level 0.05 test, the maximum value of S 




Fi gUre 7 : The extrinsic sample mean shape of a sample of 87 pear contours and, respectively, the hypothesized extrinsic 
mean shape 

for which we would reject the null hypothesis is 1.2941. This value of S is almost 92 
times greater than the distance between the nearly identical fc-ads. This suggests that 
even if we greatly relax the constraints for similarity, the null hypothesis would still 
be rejected. This again agrees with intuition. In this last example, consider another 
sample of contours of pears. In this scenario, we consider a sample of n = 83 pears. 
The sample extrinsic mean shape and hypothesized extrinsic mean shape are shown in 
Fig |8] After performing the calculations, we determined that for an asymptotic level 
0.05 test, the largest value of 6 for which we would reject the null hypothesis is 0. 1969, 
meaning that our procedure does not reject the null hypothesis, unless 6 is smaller then 
0.1969. For perspective, this neighborhood has a radius nearly 14 times larger than 
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Fi gUre 8 '. The extrinsic sample mean shape of a sample of 83 pear contours and, respectively, the hypothesized extrinsic 
mean shape 



the example with the nearly identical fc-ads described above. Unlike in the previous 
two examples it is unclear whether the null hypothesis should be rejected in this case 
without having a specific application in mind and, as such, this could be considered a 
borderline case. 

8 Bootstrap Confidence Regions for Means of Shapes 
of Contours 

Another method for performing inference, which we consider now, is through the use 
of nonparametric nonpivotal bootstrap. By repeatedly resampling from the available 
data and computing the distance between each resampled mean and the sample mean, 
we can obtain a confidence region for the extrinsic mean shape (for the sparse case, 
see Bandulasiri et al. (2008) JH and Amaral et al (2010) lH]). The following examples 
of 95% nonparametric nonpivotal bootstrap confidence regions illustrate this approach 
using 400 resamples and serve to illustrate a methodology for visually understanding 
the confidence regions and their behavior For each example, the sample is displayed 
on the left and the 95% confidence region is displayed on the right in blue with the 
extrinsic sample mean plotted in red. The first example, shown in Fig. |9] reveals that 
the confidence regions are wider in the portions of the shape in which there is more 
variability in the sample. Here, the bands are thicker in the regions corresponding to 
the tail and the top and bottom of the front section of the stingray, where the variability 
is the greatest. Secondly, samples with less variability result in narrower confidence 
regions. This can be seen by comparing Figs. l9]and[T0l It is easy to see that there 
is less overall variability in the shapes of the contours of the wormfish than there is 
for the stingrays, which is reflected in the widths of the confidence regions. Further- 
more, the effect of sample size on the confidence regions is clearly displayed in Fig. 
nn As should be expected, the confidence region constructed using 88 observations 
is substantially thinner than that constructed using just 20 observations. In addition 
to being able to obtain sensible and intuitive results, the processing time needed to 
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Figure 9: Bootstrap 95% confidence regions for the extrinsic mean shape of stingray contours using a sample of size 
20. 




Fi gUre 10! Bootstrap 95% confidence regions for the extrinsic mean shape of wormfish contours using a sample of size 
20. 

(a) (b) 




Fi gUre 1 1 : Bootstrap 95% confidence region for the extrinsic mean shape for the pears based on (a) 20 observations 
and (b) B8 observations. 

compute bootstrap confidence regions for the VW extrinsic mean is small compared 
to doing the same using the elastic framework for the analyzing the shape of planar 
curves. The higher computational cost is due to a combination of the intrinsic analysis 
and the elastic representation. The calculation of an intrinsic mean requires the use of 
an iterative algorithm. The square-root elastic framework of Joshi et al. (2007) 1201 
adapts the algorithm of Klassen et al. (2004) for arc-length parametrized curves by in- 
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serting a reparametrization step at each iteration. This reparametrization step requires 
the use of either a dynamic programming algorithm or a gradient descent approach. 
These time-consuming steps are repeated a number of times during the calculation of 
the intrinsic mean, which, when obtaining a bootstrap confidence region, results in the 
computational cost being further compounded. As an example, this methodology was 
performed on a sample of hand gestures representing the letter "L" using the concepts 
of elastic shape representation, as described in Joshi et al. (2007) 1201 . and our metho- 
dology. The resulting confidence regions are given in Fig. [12] Using MATLAB on a 
machine running Windows XP on an Intel Core 2 Duo processor running at 2.33 GHz, 
these computations required 47.9 hours for the elastic method and only 47.5 seconds 
for ours. The difference in the size of the contours displayed in FiglT2]is due to the 
approaches using different methods for normalization. While we scale the complex 
vector denoting the coordinates for the contour at the sampled times to have a norm 
of 1, the square-root elastic framework scales curves to have unit length. While both 
methods perform well at producing estimates for mean shape and providing bootstrap 
confidence regions, our approach is far more computationally efficient. For a more de- 
tailed account of the advantages of extrinsic analysis of data on manifolds, especially 
for obtaining bootstrap confidence regions, see also Bhattacharya et al. (20 12)171. 

(a) (b) 
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Fi gure 12; Bootstrap 95% confidence regions for (a) the intrinsic mean shape, as defined by Joshi et al. (2007), and (b) 
the extrinsic mean shape of the "1" hand gesture. 



9 Discussion 

In this paper, we have described how to address the neighborhood hypothesis for one 
population mean on a Hilbert manifold. This first paper on data analysis on a Hilbert 
manifold opens up a new area for data analysis of infinite dimensional objects that can 
not be represented on a Hilbert space. This is a rich domain for further study, with 
potential extensions and techniques coming from recent advances in statistics on fi- 
nite dimensional manifolds and data analysis on infinite dimensional Hilbert spaces. 
Regarding applications in shape analysis, while our theory and computational metho- 
dology leads to the estimation of the extrinsic mean direct similarity shapes of planar 
contours, this approach could be extended further to any infinite configurations in the 
Euclidean plane, including 1 -dimensional CW-complexes and planar domains, given 



24 



that the plane is separable. For example, one may consider shapes of edge maps ob- 
tained from gray-level images. In these cases, the problem of properly matching be- 
comes much more difficult because, not only do points on a given edge from one image 
need to be matched to corresponding points on the corresponding edge in another im- 
age, but each edge in an image must be matched to the corresponding edge in another 
image, as well. 

Additionally, we have only considered inference techniques for one-sample prob- 
lems. While two-sample and multi-sample procedures may be more practical for many 
data analysis purposes, both the one-sample neighborhood hypothesis test and the 
non-pivotal nonparametric bootstrap confidence regions are useful nonparametric tech- 
niques for the estimation of the extrinsic mean shape for planar contours. In addition, 
they serve as an important first step towards the development and/or adapting of the 
desired two-sample and multi-sample procedures. 

Furthermore, it should be noted that planar contour data carries with it innate diffi- 
culties that can be challenging to account for in the data analysis described in this paper 
First, it is important to maintain a consistent camera position with respect to the object 
of interest to ensure that direct similarity shape analysis is appropriate. Otherwise, it 
may be more appropriate to consider projective shape analysis instead. 

Finally, planar contours may commonly depict images of 3D objects. In the case 
that the object is relatively flat, such as with the stingrays in the example here, a slight 
shift in the camera angle may not result in a substantial change in the contour obtained 
from the digital image. However, for other 3D objects, such as the dogs and hand 
gestures, a slight shift may result in a drastic change in the form of the associated 
contour, which is an issue that has largely been ignored in the literature. In such cases, 
neither planar direct similarity shape nor planar projective shape may be an adequate 
descriptor for analyzing the contour data. Because of these issues, it is important to 
take great care with planar shape analysis of contours that arise from images of 3D 
solid objects. In such cases, where there is an absence of additional information on 
the scenes pictured, this care can help to ensure that a meaningful analysis can be 
conducted. 
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